Approximating discrete probability distributions with Bayesian networks

نویسنده

  • Jon Williamson
چکیده

I generalise the arguments of [Chow & Liu 1968] to show that a Bayesian network satisfying some arbitrary constraint that best approximates a probability distribution is one for which mutual information weight is maximised. I give a practical procedure for finding an approximation network. The plan is first to discuss the approximation problem and its link with Bayesian network theory. After identifying the optimal approximation, the bulk of the paper will be devoted to a proposed approximation algorithm which sacrafices optimality in favour of practicality. We shall investigate the theoretical basis and performance of this approximation strategy. 1 THE APPROXIMATION PROBLEM It can be quite unfeasible to determine, store and manipulate a discrete probability distribution. Working with a distribution over binary variables, for instance, one must find and store at least probability values and it may still take too long to calculate the probability of a statement (or event) of interest. Consequently the question of whether one can approximate a distribution sufficiently accurately using a reasonable number of specifying values is of key importance. [Chow & Liu 1968] partially answered this question using an innovative and highly practical method. I shall briefly outline their answer in the language of Bayesian network theory. A Bayesian network is a directed acyclic graph (dag) over nodes , together with a set of probability values for each node literal (of the form where , the values that node can take) and each state (a conjunction of literals ) of the direct predecessors or parents of . Under an independence asNote that for each node the probability of one literal can be sumption, namely that any node is independent of its nondescendants conditional on its parents, a Bayesian network suffices to determine a probability distribution, since for each atomic state of the form we have that where is the literal and is the parent state of node consistent with (the factor is taken to be the unconditional probability if has no parents). Thus, depending on the connectivity in the graph, the Bayesian network representation of a probability function can dramatically reduce the number of probability values required to specify the function. I shall call the number of specifying probabilities the complexity of a Bayesian network. While a complete dag on binary-valued nodes requires maximum complexity in a Bayesian network, a tree-shaped dag only requires specifiers, a significant saving. Chow and Liu showed that a tree-based Bayesian network (with induced probability distribution ) that best approximates a target distribution (according to the cross-entropy measure of distance between distributions where the are the atomic states) is a maximum weight spanning tree where arcs between two nodes are weighted by their mutual information

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Networks as Coalgebras

A coalgebraic description of (discrete) Bayesian networks is presented. The coalgebra map representing a network sends a node to its set of predecessors, together with the associated conditional probability tables. We use this description to describe the semantics of a network in terms of various (discrete) probability distributions associated with a node: local, joint, and conditional distribu...

متن کامل

Inference in Hybrid Bayesian Networks Using Mixtures of Gaussians

The main goal of this paper is to describe a method for exact inference in general hybrid Bayesian networks (BNs) (with a mixture of discrete and continuous chance variables). Our method consists of approximating general hybrid Bayesian networks by a mixture of Gaussians (MoG) BNs. There exists a fast algorithm by Lauritzen-Jensen (LJ) for making exact inferences in MoG Bayesian networks, and t...

متن کامل

Determination of Maximum Bayesian Entropy Probability Distribution

In this paper, we consider the determination methods of maximum entropy multivariate distributions with given prior under the constraints, that the marginal distributions or the marginals and covariance matrix are prescribed. Next, some numerical solutions are considered for the cases of unavailable closed form of solutions. Finally, these methods are illustrated via some numerical examples.

متن کامل

Arrival probability in the stochastic networks with an established discrete time Markov chain

The probable lack of some arcs and nodes in the stochastic networks is considered in this paper, and its effect is shown as the arrival probability from a given source node to a given sink node. A discrete time Markov chain with an absorbing state is established in a directed acyclic network. Then, the probability of transition from the initial state to the absorbing state is computed. It is as...

متن کامل

On Approximating Multidimensional Probability Distributions by Compositional Models

Because of computational problems, multidimensional probability distributions must be approximated by distributions which can be defined by a reasonable number of parameters. As a rule, distributions with a special dependence structure (i.e., complying with a system of conditional independence relations) are considered; graphical Markov models and especially Bayesian networks are often used. Th...

متن کامل

A continuous approximation fitting to the discrete distributions using ODE

The probability density functions fitting to the discrete probability functions has always been needed, and very important. This paper is fitting the continuous curves which are probability density functions to the binomial probability functions, negative binomial geometrics, poisson and hypergeometric. The main key in these fittings is the use of the derivative concept and common differential ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003